智能论文笔记

Active Learning for Optimal Intervention Design in Causal Models

Jiaqi Zhang , Louis Cammarata , Chandler Squires , Themistoklis P. Sapsis , Caroline Uhler

分类：机器学习

2022-09-10

跨学科的一个重要问题是发现产生预期结果的干预措施。当可能的干预空间很大时，需要进行详尽的搜索，需要实验设计策略。在这种情况下，编码变量之间的因果关系以及因此对系统的影响，对于有效地确定理想的干预措施至关重要。我们开发了一种迭代因果方法来识别最佳干预措施，这是通过分布后平均值和所需目标平均值之间的差异来衡量的。我们制定了一种主动学习策略，该策略使用从不同干预措施中获得的样本来更新有关基本因果模型的信念，并确定对最佳干预措施最有用的样本，因此应在下一批中获得。该方法采用了因果模型的贝叶斯更新，并使用精心设计的，有因果关系的收购功能优先考虑干预措施。此采集函数以封闭形式进行评估，从而有效优化。理论上以信息理论界限和可证明的一致性结果在理论上基于理论上的算法。我们说明了综合数据和现实世界生物学数据的方法，即来自worturb-cite-seq实验的基因表达数据，以识别诱导特定细胞态过渡的最佳扰动；与几个基线相比，观察到所提出的因果方法可实现更好的样品效率。在这两种情况下，我们都认为因果知情的采集函数尤其优于现有标准，从而允许使用实验明显更少的最佳干预设计。

translated by 谷歌翻译

Information FOMO: The unhealthy fear of missing out on information. A method for removing misleading data for healthier models

Ethan Pickering , Themistoklis P. Sapsis

分类：机器学习 | (统计)机器学习

2022-08-27

并非所有数据都相等。误导或不必要的数据可能会严重阻碍机器学习（ML）模型的准确性。当数据丰富时，可以克服误导性效果，但是在许多现实世界中，数据稀疏且获取昂贵。我们提出了一种方法，该方法大大降低了准确训练ML模型所需的数据大小，从而有可能为ML中许多新的有限数据应用程序打开大门。我们的方法提取了最有用的数据，同时忽略和省略了将ML模型误导为下等级属性的数据。具体而言，该方法消除了“双重下降”的现象，其中更多的数据导致性能较差。这种方法为ML社区带来了一些关键功能。值得注意的是，该方法自然收敛并消除了将数据集分为培训，测试和验证数据的传统需求。相反，选择度量固有地评估了测试误差。这样可以确保在测试或验证中永远不会浪费关键信息。

translated by 谷歌翻译

Discovering and forecasting extreme events via active learning in neural operators

Ethan Pickering , Stephen Guth , George Em Karniadakis , Themistoklis P. Sapsis

分类：机器学习 | (统计)机器学习

2022-04-05

社会和自然中的极端事件，例如大流行尖峰，流氓波浪或结构性失败，可能会带来灾难性的后果。极端的表征很困难，因为它们很少出现，这似乎是由良性的条件引起的，并且属于复杂且通常是未知的无限维系统。这种挑战使他们将其描述为“毫无意义”。我们通过将贝叶斯实验设计（BED）中的新型训练方案与深神经操作员（DNOS）合奏结合在一起来解决这些困难。这个模型不足的框架配对了一个床方案，该床方案积极选择数据以用近似于无限二二维非线性运算符的DNO集合来量化极端事件。我们发现，这个框架不仅清楚地击败了高斯流程（GPS），而且只有两个成员的浅色合奏表现最好； 2）无论初始数据的状态如何（即有或没有极端），都会发现极端； 3）我们的方法消除了“双研究”现象； 4）与逐步全球Optima相比，使用次优的采集点的使用不会阻碍床的性能； 5）蒙特卡洛的获取优于高量级的标准优化器。这些结论共同构成了AI辅助实验基础设施的基础，该基础设施可以有效地推断并查明从物理到社会系统的许多领域的关键情况。

translated by 谷歌翻译

Output-weighted and relative entropy loss functions for deep learning precursors of extreme events

Samuel Rudy , Themistoklis Sapsis

分类：机器学习

2021-12-01

许多科学和工程问题需要具有稀有和极端事件的准确模型。这些问题对数据驱动建模具有具有挑战性的任务，许多天真的机器学习方法无法预测或准确地量化这些事件。这种困难的一个原因是，具有极端事件的系统，根据定义，产生不平衡数据集，并且该标准损耗功能容易忽略稀有事件。也就是说，适合培训模型的良好良好度量的指标并非旨在确保对罕见事件的准确性。这项工作旨在通过考虑设计为突出异常值的损耗函数来提高回归模型的回归模型的性能。我们提出了一种新颖的损失功能，调整后的输出加权损耗，并将基于熵的损耗功能的适用性扩展到具有低维输出的系统。使用呈现极端事件的几种动态系统的案例测试所提出的功能，并显示在极端事件的预测中显着提高准确性。

translated by 谷歌翻译

Protein-Ligand Complex Generator & Drug Screening via Tiered Tensor Transform

Jonathan P. Mailoa , Zhaofeng Ye , Jiezhong Qiu , Chang-Yu Hsieh , Shengyu Zhang

分类：神经与进化计算

2023-01-03

Accurate determination of a small molecule candidate (ligand) binding pose in its target protein pocket is important for computer-aided drug discovery. Typical rigid-body docking methods ignore the pocket flexibility of protein, while the more accurate pose generation using molecular dynamics is hindered by slow protein dynamics. We develop a tiered tensor transform (3T) algorithm to rapidly generate diverse protein-ligand complex conformations for both pose and affinity estimation in drug screening, requiring neither machine learning training nor lengthy dynamics computation, while maintaining both coarse-grain-like coordinated protein dynamics and atomistic-level details of the complex pocket. The 3T conformation structures we generate are closer to experimental co-crystal structures than those generated by docking software, and more importantly achieve significantly higher accuracy in active ligand classification than traditional ensemble docking using hundreds of experimental protein conformations. 3T structure transformation is decoupled from the system physics, making future usage in other computational scientific domains possible.

translated by 谷歌翻译

Posterior Collapse and Latent Variable Non-identifiability

Yixin Wang , David M. Blei , John P. Cunningham

分类： (统计)机器学习 | 机器学习

2023-01-02

Variational autoencoders model high-dimensional data by positing low-dimensional latent variables that are mapped through a flexible distribution parametrized by a neural network. Unfortunately, variational autoencoders often suffer from posterior collapse: the posterior of the latent variables is equal to its prior, rendering the variational autoencoder useless as a means to produce meaningful representations. Existing approaches to posterior collapse often attribute it to the use of neural networks or optimization issues due to variational approximation. In this paper, we consider posterior collapse as a problem of latent variable non-identifiability. We prove that the posterior collapses if and only if the latent variables are non-identifiable in the generative model. This fact implies that posterior collapse is not a phenomenon specific to the use of flexible distributions or approximate inference. Rather, it can occur in classical probabilistic models even with exact inference, which we also demonstrate. Based on these results, we propose a class of latent-identifiable variational autoencoders, deep generative models which enforce identifiability without sacrificing flexibility. This model class resolves the problem of latent variable non-identifiability by leveraging bijective Brenier maps and parameterizing them with input convex neural networks, without special variational inference objectives or optimization tricks. Across synthetic and real datasets, latent-identifiable variational autoencoders outperform existing methods in mitigating posterior collapse and providing meaningful representations of the data.

translated by 谷歌翻译

Pseudo-Inverted Bottleneck Convolution for DARTS Search Space

Arash Ahmadian , Yue Fei , Louis S. P. Liu , Konstantinos N. Plataniotis , Mahdi S. Hosseini

分类：机器学习

2022-12-31

Differentiable Architecture Search (DARTS) has attracted considerable attention as a gradient-based Neural Architecture Search (NAS) method. Since the introduction of DARTS, there has been little work done on adapting the action space based on state-of-art architecture design principles for CNNs. In this work, we aim to address this gap by incrementally augmenting the DARTS search space with micro-design changes inspired by ConvNeXt and studying the trade-off between accuracy, evaluation layer count, and computational cost. To this end, we introduce the Pseudo-Inverted Bottleneck conv block intending to reduce the computational footprint of the inverted bottleneck block proposed in ConvNeXt. Our proposed architecture is much less sensitive to evaluation layer count and outperforms a DARTS network with similar size significantly, at layer counts as small as 2. Furthermore, with less layers, not only does it achieve higher accuracy with lower GMACs and parameter count, GradCAM comparisons show that our network is able to better detect distinctive features of target objects compared to DARTS.

translated by 谷歌翻译

Physics-informed Neural Networks approach to solve the Blasius function

Greeshma Krishna , Malavika S Nair , Pramod P Nair , Anil Lal S

分类：机器学习

2022-12-31

Deep learning techniques with neural networks have been used effectively in computational fluid dynamics (CFD) to obtain solutions to nonlinear differential equations. This paper presents a physics-informed neural network (PINN) approach to solve the Blasius function. This method eliminates the process of changing the non-linear differential equation to an initial value problem. Also, it tackles the convergence issue arising in the conventional series solution. It is seen that this method produces results that are at par with the numerical and conventional methods. The solution is extended to the negative axis to show that PINNs capture the singularity of the function at $\eta=-5.69$

translated by 谷歌翻译

Comparative Analysis of Clustering Techniques for Personalized Food Kit Distribution

Jude Francis , Rowan K Baby , Jacob Abraham , Ajmal P. S

分类：机器学习 | (统计)机器学习

2022-12-30

The Government of Kerala had increased the frequency of supply of free food kits owing to the pandemic, however, these items were static and not indicative of the personal preferences of the consumers. This paper conducts a comparative analysis of various clustering techniques on a scaled-down version of a real-world dataset obtained through a conjoint analysis-based survey. Clustering carried out by centroid-based methods such as k means is analyzed and the results are plotted along with SVD, and finally, a conclusion is reached as to which among the two is better. Once the clusters have been formulated, commodities are also decided upon for each cluster. Also, clustering is further enhanced by reassignment, based on a specific cluster loss threshold. Thus, the most efficacious clustering technique for designing a food kit tailored to the needs of individuals is finally obtained.

translated by 谷歌翻译

Symbolic Visual Reinforcement Learning: A Scalable Framework with Object-Level Abstraction and Differentiable Expression Search

Wenqing Zheng , S P Sharan , Zhiwen Fan , Kevin Wang , Yihan Xi , Zhangyang Wang

分类：机器学习 | 人工智能

2022-12-30

Learning efficient and interpretable policies has been a challenging task in reinforcement learning (RL), particularly in the visual RL setting with complex scenes. While neural networks have achieved competitive performance, the resulting policies are often over-parameterized black boxes that are difficult to interpret and deploy efficiently. More recent symbolic RL frameworks have shown that high-level domain-specific programming logic can be designed to handle both policy learning and symbolic planning. However, these approaches rely on coded primitives with little feature learning, and when applied to high-dimensional visual scenes, they can suffer from scalability issues and perform poorly when images have complex object interactions. To address these challenges, we propose \textit{Differentiable Symbolic Expression Search} (DiffSES), a novel symbolic learning approach that discovers discrete symbolic policies using partially differentiable optimization. By using object-level abstractions instead of raw pixel-level inputs, DiffSES is able to leverage the simplicity and scalability advantages of symbolic expressions, while also incorporating the strengths of neural networks for feature learning and optimization. Our experiments demonstrate that DiffSES is able to generate symbolic policies that are simpler and more and scalable than state-of-the-art symbolic RL methods, with a reduced amount of symbolic prior knowledge.

translated by 谷歌翻译